An Assessment of Cyc for Natural Language Processing
نویسندگان
چکیده
This is the final report on the assessment of Cyc for natural language processing applications. The work reported here was carried out by the authors at CRL, NMSU under collaboration with both the Department of Defense and Cycorp, Inc. The primary motivation of this relatively small-scale exercise was to arrive at an independent assessment of the utility of Cyc’s knowledge and inference capabilities for solving difficult problems in NLP and machine translation. Word sense disambiguation and coreference resolution were chosen as the two problems for this study. We conclude from this exercise that Cyc in fact has a large amount of knowledge that is potentially useful for solving these problems in NLP. However, the knowledge in Cyc is not directly applicable to the problems either in an exclusively Cyc-based solution or one where Cyc is used to improve the performance of other methods. In this report, we have attempted to identify the primary reasons why Cyc cannot readily solve NLP problems, to illustrate our findings with many real-world examples, and to suggest changes or enhancements to Cyc that might make its knowledge more readily applicable to NLP problems.
منابع مشابه
The Polish Cyc lexicon as a bridge between Polish language and the Semantic Web
In this paper we discuss the problem of building the Polish lexicon for the Cyc ontology. As the ontology is very large and complex we describe semi-automatic translation of part of it, which might be useful for tasks lying on the border between the fields of Semantic Web and Natural Language Processing. We concentrate on precise identification of lexemes, which is crucial for tasks such as nat...
متن کاملDynamic Mediation for Removing Language Comprehension Problems: A Psychological Support for Listening Comprehension Mental Processing
Dynamic Assessment is an approach to assessment within Applied Linguistics which is stemmed from Vygotsky’s Socio-Cultural Theory of mind, his concept of Zone of Proximal Development and Feuerstein's theory of Structural Cognitive Modifiability. This study is an attempt to pinpoint the sources of mental processing problems in listening comprehension and applies dynamic interventions to remove t...
متن کاملInformation Extraction as a Stepping Stone toward Story Understanding
Historically, story understanding systems have depended on a great deal of hand-crafted knowledge. Natural language understanding systems that use conceptual knowledge structures [Schank and Abelson, 1977; Cullingford, 1978; Wilensky, 1978; Carbonell, 1979; Lehnert, 1981; Kolodner, 1983] typically rely on enormous amounts of manual knowledge engineering. While much of the work on conceptual kno...
متن کاملWikipedia-based Semantic Interpretation for Natural Language Processing
Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, cal...
متن کاملComputing semantic relatedness of words and texts in Wikipedia-derived semantic space
Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was either based on purely statistical techniques that did not make use of background knowledge or on huge manual efforts, such as the CYC projects. Here we propose a novel method, called Explicit Semantic Analysis (ESA), for finegrai...
متن کامل